Speeding - Up Adaptive Heuristic Critic

نویسنده

Eduardo Sanchez

چکیده

Neurocontrol is a crucial area of fundamental research within the neural network eld. Adaptive Heuristic Critic learning is a key algorithm for real time adaptation in neurocontrollers. In this paper we present how an unsupervised neural network model with adaptable structure can be used to speed-up Adaptive Heuristic Critic learning, its FPGA design , and how it adapts the neurocontroller to the state space of the system being controlled.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Beyond Adaptive Critic - Creative Learning for Intelligent Autonomous Mobile Robots

Intelligent industrial and mobile robots may be considered proven technology in structured environments. Teach programming and supervised learning methods permit solutions to a variety of applications. However, we believe that to extend the operation of these machines to more unstructured environments requires a new learning method. Both unsupervised learning and reinforcement learning are pote...

متن کامل

Model-Based Adaptive Critic Designs

Editor’s Summary: This chapter provides an overview of model-based adaptive critic designs, including background, general algorithms, implementations, and comparisons. The authors begin by introducing the mathematical background of model-reference adaptive critic designs. Various ADP designs such as Heuristic Dynamic Programming (HDP), Dual HDP (DHP), Globalized DHP (GDHP), and Action-Dependent...

متن کامل

Stochastic Control Strategies and Adaptive Critic Methods

Adaptive critic methods have common roots as generalizations of dynamic programming for neural reinforcement learning approaches. Since they approximate the dynamic programming solutions, they are potentially suitable for learning in noisy, nonlinear and nonstationary environments. In this study, a novel probabilistic dual heuristic programming (DHP) based adaptive critic controller is proposed...

متن کامل

Efficient Reinforcement Learning Using Recursive Least-Squares Methods

The recursive least-squares (RLS) algorithm is one of the most well-known algorithms used in adaptive filtering, system identification and adaptive control. Its popularity is mainly due to its fast convergence speed, which is considered to be optimal in practice. In this paper, RLS methods are used to solve reinforcement learning problems, where two new reinforcement learning algorithms using l...

متن کامل

Reinforcement Control via Heuristic Dynamic Programming

Heuristic Dynamic Programming (HDP) is the simplest kind of Adaptive Critic which is a powerful form of reinforcement control 1]. It can be used to maximize or minimize any utility function, such as total energy or trajectory error, of a system over time in a noisy environment. Unlike supervised learning, adaptive critic design does not require the desired control signals be known. Instead, fee...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1997

Speeding - Up Adaptive Heuristic Critic

نویسنده

چکیده

منابع مشابه

Beyond Adaptive Critic - Creative Learning for Intelligent Autonomous Mobile Robots

Model-Based Adaptive Critic Designs

Stochastic Control Strategies and Adaptive Critic Methods

Efficient Reinforcement Learning Using Recursive Least-Squares Methods

Reinforcement Control via Heuristic Dynamic Programming

عنوان ژورنال:

اشتراک گذاری